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Abstract 

In this paper, we analyze a multiple-input multiple-output (MIMO) interference channel where 
nodes are randomly distributed on a plane as a spatial Poisson cluster point process. Each cluster uses 
interference alignment (lA) to suppress intra-cluster interference but unlike most work on lA, we do 
not neglect inter-cluster interference. We also connect the accuracy of channel state information to the 
distance between the nodes, i.e. the quality of CSI degrades with increasing distance. Accounting for the 
training and feedback overhead, we derive the transmission capacity of this MIMO lA ad hoc network 
and then compare it to open-loop (interference-blind) spatial multiplexing. Finally, we present exemplary 
system setups where spatial multiplexing outperforms lA due to the imperfect channel state information 
or the non-aligned inter-cluster interference. 

I. Introduction 

Interference alignment (lA) acliieves the degrees of freedom in tlie i^-user MIMO interference 
cliannel [[2|. lA confines tlie interference to a subspace at each receiver such that an interference- 
free subspace becomes available for the desired signal transmission. Except for blind lA tech- 
niques [[3j, [|4], which usually attain a lower multiplexing gain, lA requires cooperation between 
the transmitting/receiving nodes using global channel state information (CSI) [5| or some form 
of channel reciprocity. Although the impact of inaccurate CSI on the performance of lA has 
been studied before [|6|, existence of non-cooperating nodes casting non-aligned interference has 
been mostly ignored. 

A. Background 

In large networks, such as mobile ad hoc and cellular networks, lA will be used independently 
in separate clusters and the nearby nodes that are not coordinating with any one cluster will cause 
non-aligned interference at the receivers. There are three main reasons. 

1) Number of antennas at each node is a limiting factor It is shown in [|2| that the number of 
nodes that can cooperate through MIMO lA is limited by the number of antennas at each 
node. 

2) Overhead practically limits the cluster size. The overhead of lA grows super linearly with 
the number of users [|7|, and hence it is likely that small groups of nodes will coordinate 
to perform lA. 
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3) More cooperation is not always better. Recently, [[8| showed that because of inherent channel 
uncertainty, there is a moderate cluster size above which spectral efficiency at best saturates, 
and in many practical scenarios (e.g. when pilots are used for channel estimation), actually 
decreases if more nodes join the cluster to cooperate. 
In this case, single cluster analysis (e.g. DoF studies) does not capture the impact of interference 
from the other nodes in the network and can lead to unrealistic cooperation gains which would 
not be attainable if the inter-cluster interference was accounted for [[8|, [j9|. Most work on the 
performance of lA, however, is confined to single-cluster performance analysis (see 1 10 1-| 12 1 
and references therein). 

When dealing with large networks, a relevant metric of the system performance is the trans- 
mission capacity [13|, defined as the number of successful transmission per unit area, subject 
to a constraint on outage probability. The transmission capacity, in contrast to other network- 
wide system performance metrics such as transport capacity, generally leads to closed-form 
expressions or tight bounds providing insight into the network design parameters [14|. To the 
best of our knowledge, little prior work on the network-wide performance of MIMO lA systems 
exists. In [ |15| , the spatial distribution of nodes is taken into account for deriving the point-to- 
point outage probability but the accuracy of the acquired CSI is ignored. By assuming perfect 
CSI, however, as discussed in [6. Section V], the authors in \l5j effectively favor lA over 
other transmission techniques which either do not require CSI at the transmitters or are less 
sensitive to CSI imperfections. Therefore, the goal of this paper is take into account both the 
node distribution and the CSI uncertainty to better understand the performance of MIMO lA in 
large decentralized networks. 



B. Contributions 

In this paper, we find the transmission capacity of a large ad hoc network where nodes 
are partitioned into separate clusters each cooperating through lA. We assume a four-stage 
transmission protocol. In the first stage, with a finite length training period, imperfect CSI 
for the cross links is obtained through MMSE channel estimation. In the second stage, the 
estimated CSI is fed back to the other nodes in the cluster during the feedback period. In the 
third stage, the lA transmit/receive filters are computed. In the last stage, using the rest of the 
finite-length channel block, the nodes communicate data using a cluster-wise slotted Aloha-like 
channel access protocol where at random, all nodes in a cluster either transmit simultaneously or 
turn off their transmission. MIMO lA, as discussed in this paper, requires synchronization and 
coordination in each cluster and therefore a coordinated decision to transmit or not is reasonable. 

Toward deriving the transmission capacity of this network, we first derive the exact point-to- 
point outage probability at a typical receiver. Then, assuming fixed feedback overhead, we solve 
for the optimum training period locally maximizing each cluster's goodput. Next, we derive the 
transmission capacity and compare it to a network with the same topology where only a single 
transmit/receiver pair in each cluster utilizes spatial multiplexing at each time instant. 

Our results indicate that the transmission technique of choice is a function of the node 
density, the mobility of the nodes, the transmit power, and the characteristics of the underlying 
communications medium. For example, in dense networks with high transmit power, spatial 
multiplexing (SM) over an orthogonal channel access strategy such as time division multiple 
access (TDMA) can outperform lA due to lower inter-cluster interference. Also, the signal-to- 
noise-ratio (SNR) switching point between lA and TDMA-i-SM decreases with increasing density 
and mobility. Our initial work in yj only deals with point-to-point outage probability. This paper 
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elaborates on the claims of [[T|, solves for the optimum training period, finds the corresponding 
transmission capacity of the network, provides easier to compute bounds in several important 
cases, and presents new simulation results. 

C. Organization and Notation 

The remainder of the paper is organized as follows. In Section [ll] we present the system 
model. In Section [In] we analyze the performance of intra-cluster MIMO lA through quantifying 



point-to-point outage probability and the transmission capacity. In Section IV we derive parallel 
performance metrics for the same network utilizing spatial multiplexing. We present numerical 
results in Section |V] followed by concluding remarks in Section VI Also, Tables |I] and |ll] present 



the notation used and the important symbols defined in this manuscript. 

II. System Model 

The spatial locations of the potential transmitters, are modeled as a planar Neyman- 
Scott cluster point process [16]. In this process, the cluster centers are modeled by a parent 
homogeneous Poisson point process (PPP) #p of density Ap. Each parent point x G $p forms 
the center of a cluster around which K daughter points are uniformly distributed in a circle of 
radius R. The resulting proces^is a stationary point process of density KXp. We assume clusters 
randomly access the channel with probability Pa effectively reducing the density of this PPP to 
Ap = -PAAp. The receiver of a transmitter at x is denoted by x and is assumed to be randomly 
located at distance from its transmitter forming an x MIMO link. The receivers are not 
part of the point process An instance of the nodes' location is shown in Fig. [T] 

In this paper, a typical transmitter (a transmitter chosen at random) is considered and its 
performance is analyzed. This transmitter is typical in the sense that the performance of lA in 
this node is a representative of the average performance of lA in the network [16J , [fTTl. Since 
the underlying point process is stationary, without loss of generality, it can be assumed that the 
typical transmitter is at the origiij^ Denote the cluster to which the transmitter at the origin 
belongs by The received signal at receiver x, x E ^o, is 

/g~H£^F^s^ + Oc + U£, (1) 

where 3c = Xlze#/*o v^^H^^F^s^ is the inter-cluster interference, gxz and H^^ represent the 
pathloss and the matrix of channel coefficients between the transmitter z and the receiver x, 
Fj, is the precoder at transmitter z with the transmitted signal such that E{s*s^} = P, 
and U£ ~ CK(0, Nol) is the additive white Gaussian noise. In this paper, it is assumed that 
F^Fz = I, because of tractability and the observation that the gain attained otherwise, such as 



with the MMSE algorithm in [19| or the Max-SINR algorithm in [18], is limited and confined 
to the low SNR regime where the inter-cluster interference is generally not dominant. In every 
cluster, channel state information is estimated at the receivers as in pO| and conveyed to all 



other nodes of the cluster using an error-free instantaneous feedback link. We propose to model 



the uncertainty in the MIMO channels using a Gauss-Markov model of the form [21 1, |22| 



= Jl-/3? H^, + x,ze^o, (2) 



'The parent points $p will not be a part of the final point process. 

^The whole point process can be translated so the randomly picked node is located at the origin. 
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where H^^ is the estimated channel, E^^ represents the estimation error with i.i.d. terms dis- 
tributed as C[Nf(0, 1), and is the normalized variance of the estimation error. It is assumed that 
the channel is quasi-static block-fading such that H is constant for a block duration of length T 
and then changes independently. Training, feedback, and data transmission are assumed to be all 
orthogonal in time, in the same coherence time or frame T [23]. Hence, /^^^ is set to be related 



to the average received SNR at each link, 7^.2, as [20 Section II.B] 



R-i = I = I (3) 



where 7o = ]^ and Tt > KN is the number of channel instances spent for training 11^2 [lOj. 
For analytical tractability, it is also assumed that H"" is used to construct the precoders/equalizers 
and nodes effectively ignore the imperfection in CSI in their design. 

III. INTRA-CLUSTER INTERFERENCE ALIGNMENT 

At each cluster, a K-user system of lA is feasible if there exists a set of matrices W = 
{W2I2; G ^0} such that, given the received signal of ([T]), the following constraints are met [5|: 

rank(W,H,.F.) = iV. ^ ^ ^ .4. 

where is the combining filter used at receiver x and Ng is the number of interference-free 
streams each transmitter can send to its receiver. The linear equalizer presented in [[6| and the 
projection matrix presented in [19, Section III.A] are examples of a possible receive filter in (Q. 



It is assumed that the lA precoders are designed using the alternating minimization algorithm 



in [19, Section III.A] such that F^; is independent of H^^ for all x E ^. Also, it is assumed 
that the set of {N, Ns, K} constitutes a feasible lA system, which for the MIMO interference 
channel requires that 2N - {K + l)Ns > 0. 

A. Characterizing the SINK 

From (|4]), interference at receiver x is confined to an — A^^ dimensional subspace. Let [{■}] 
represent horizontal concatenation of the elements in {■}. Then, as lA precoders/equalizers are 
constructed using H'" as given by ([2]), the Nx{K-l)Ns matrix of = [{H^^F^I^; x,z e *o}] 
spans an N—Ns dimensional subspace. Let the singular value decomposition of be Uj^Sj^V}. 
and let the rows of W^. be the columns of Uj^ corresponding to zero singular values in S j . . 
As is independent of H^^F^, it satisfies the conditions in Q and is a valid zero-forcing 
(ZF) equalizer for lA. Using this ZF receiver, the post-processing SINR of the nth stream at 
receiver x is 

lA ^ 9xx{^ — I^S:x)^*xx^xx 

~ + QxxPxx^xx^xx^ ' / J dxzPxz^xz^xz ~r 9xz"-xz"-xz 

^ y ^ y 



where for all 2; G h.xz = (eJ^W^H^^F^)*, e^^ = (e* W^E^^F^)*, and e„ is the nth column of 
an Ns X Ns identity matrix. Note that Ig represents the residual error from the direct link and, as 
the distance between the transmitter and receiver is constant, its pathloss (and the error variance) 
are not random variables and so it is separated from 1^ to emphasize this point. Let the entries 
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of Hxz and E^^ be i.i.d. Gaussian terms. As and are unitary matrices independent of 
Hxz and £^2, due to the doubly unitarily invariance of the Gaussian distribution, h^^ and e^^ 
will be column vectors of length A^^ with i.i.d. Gaussian terms and independent of each other (as 
Hxz and Eixz are independent of each other). Note that ([5]) is in fact independent of the stream 
index n. 



B. Probability of Outage 

In ([5]), since hl^h^z and el^e^z are i.i.d. T{Ns, 1) random variables, we denote them both by 
for notational simplicity. Denote the typical transmitter at the origin by o. Considering a 
transmitter at the origin is equivalent to conditioning on the existence of a point at the origin. 
Since every point belongs to some cluster, conditioning on the existence of a point at the origin 
equals the presence of a cluster with a daughter point at the origin. Since the parent point process 
is a PPP, an additional cluster with a daughter point at the origin can be added to it without 
changing the statistics of the other points of the process. Succinctly, the Palm probability of a 
Neyman-Scott cluster process is P° = P * where * denotes superposition [IT6|. This implies 
that assuming a point of the cluster process at the origin equals the original point process $ plus 
an additional cluster which has a point at the origin. Also this additional cluster at the origin is 
independent of the original process Since the tagged transmitter at origin does not contribute 
to the interference at the receiver, it is convenient to use the reduced Palm probability denoted by 
F-° instead of Palm probability. Reduced Palm probability is similar to Palm probability, except 
that the point at the origin is not considered in the computation of the probability and hence 
p!« = p * {^^ \ {o}}. From the probability of success is therefore Pl^{e) = P'°(76t > 0), 
where 9 is the SINR threshold. 

Theorem 1. For the system model described in Section |^ the success probability when each 
cluster uses lA is 



Ns-l . 



—rj) d _sNs 
e To 



N 



k=0 



s + 



N 



Lt{s)L,is) 



(6) 



s=r] 



where -^j" (s) and 'Ci-{s) are Laplace transforms of intra and inter-cluster interference given by 



K-l 



B{o,R) 

=exp - Xp /l- 




7ri?2 J \s + \\x—y\ 

- B(o,R) 



(7) 



(8) 



where rj 



9oo{l-l3lJ 

Proof See Appendix |A ■ 
Note that the integrals in (7]) and ([8]) can be computed by switching to polar coordinates and 
exchanging the order of differentiation and integratiorj^ Although the probability of successful 
transmission given by (|6]) is in closed-form, numerically computing it, especially for Ns > 

'The integral in (|7} is always finite since the domain of integration is a bounded set. The integral is ([8j can be shown to be 
finite using the fact that — y\\ > \\y\\ — R for the inner integral and 1 — a;™ < exp(— a;™), m > 1 for the outer integral. 
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1 where differentiation is required, is not trivial. Lemma [T] provides bounds on the Laplace 
transforms in (|7]) and ([8]) that are easily computable. Also, Lemma [2] can be used to avoid the 
differentiation in (|6]) when A^^ > L 



Lemma 1. For K > 2 



where 



^i{s)<— / ''^^ dxd^, (10) 



= <( " ^ 4fl^ (11) 

Otherwise 



an J Iy{a,b) is the regularized incomplete beta function. 

Proof: See Appendix |B} 
Lemma 2. If rj > {Ng — l)/e, the success probability is bounded by 



Ns-l k 

Ps^ < y ^e-(''-'=/")^Ee-(^'-^/")5'^°'''>^°£-° {t] - k/e)L^\7] - k/e). (12) 

k=0 



Proof: See Appendix |Cj 



For A^s > 1, un 
on Ng in Lemma 



ike (161), (fT2l) does not have a differentiation operator. Note that the condition 

I 1 — I 1 r eOD^ 

2 is equivalent to A^s < 1 + jzrt- which is true for practical A^^ and typical 

9 I — I 

lere 6^1, Dr > 1, and 13^^ ^ 1. Also, results from Lemma [IJ can be used 



operating regime w 

to further simplify the expression in Lemma |2j 



C. Optimizing the Training Period 

For a given block fading of length T, the transmitters spend Tt > KN channel instances for 
training the links. We also assume a prefect analog feedback link where the receivers send the 
trained channels over a period of Tf = K'^N channel instances to the transmitters. In practice, the 
transmitters select to optimize some performance criteria. In this paper, we assume transmitters 
use the goodput at each cluster defined as 

T _ 7^2 jY — T 

Tt = argmax '-KNsPl^{9)\og,{l + 9) 

Tt ^ (13) 



s.t. KN <Tt<T- K^N, 



where t-k^-Ti accounts for the transmission opportunities lost due to overhead, KNg is the 
total number of streams in each cluster, and Ps^(6') log2(l + ^^) is the rate multiplied by the times 
the connection exists, i.e. SINR passes the threshold 6. Note that in ( [T3| ), Ps'^(^^) is implicitly a 
function of the training period Tt. 
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For a given node mobility and hence a Doppler frequency ^ ^, the training period can be 
written as a fraction of the total block length T, i.e. Tt = 5T = S^. Therefore, the optimization 
problem of ([13]) can be rewritten as 

6 = argmax {1-6- fdK'^N)KNsFl^{e) log2(l + 6) 

5 

s.t. fdKN<6<{l-fdK'N), 

where 



(14) 



V -- 

by 



eDr°'{'yo5+NDr"fd) 
7o'5 




s=r] 



^+fd\\x-y-o\\ 

+fd{s+\\x-y-6\\°') 



-dx 



K-l 



dy , and -i^i/s) is given 



B{o,R) ^ 

With a convex relaxation on Tt (and therefore 5) to change its domain to the real numbers, 
the optimization problem of ([14]) is convex and solvable with any of the numerical optimization 
algorithms [ |24| . Note that, although complicated, the derivative of the objective function in ( [T4[ ) 
w.r.t 5 is computable and evaluating the objective function or its derivatives for any set of values 
is possible. Next we provide approximate closed form solutions for some common cases. 

1) Single stream from each transmitter: When Ns = l, Pg^ in ( [14] ) simplifies to 



e To 



(?7o 
N 



N 



+ fdiV + D^r 



(15) 



The highly non-linear dependency of ( [15] ) on 5 can be converted into a polynomial one following 
the Taylor expansion approximation method proposed in [25]. Let g{S, fd) be the objective 
function in ( [T4] ). Rewrite g{S, fa) as its Taylor series around fd = (infinite block length and 
hence perfect training) keeping all the other variables constant 
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argmax g{6, = 0) + ^\fd=ofd - 
s OJd 

s.t. fdKN<6<{l-fdK^N). 



2dl 



2 \fd 



=ofd 



(16) 



Simplifying the terms in ( [T6| ) and removing the constant scaling coefficients from the optimization 
problem yields 



Si 



argmax 

s 



6 + Cil + C2-^ 



(17) 



s 52 

s.t. fdKN<6<{l-fdK^N), 

where Ci and C2 are given in Appendix [D] Let Si he the relevant root of the first derivative of 
( [17] ). The optimum training period will be given by 

ft,i = min (max {KN, [61T]) , T - K^N) . (18) 

When "^gdolo = ^ which is true for high SNR, ~ which implies that r] ^ 
In this case, ^ilj]) is a strictly positive function independent of Tt and ( [17] ) simplifies to 

82 ~ argmax — 5 



argmax 

5 



Di\ + D2^ 



5 ' "'52 
s.t. fdKN<6<il-fdK^N), 



(19) 
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where Di and D2 are given in Appendix |e| Similar to 1, let 82 be the relevant root of the 
first derivative of (fTO)). The optimum training period will be given by 



f;,2 = min (max {KN, [62T]) , T - K^N) . 



(20) 



2) Greater than one stream from each transmitter: For Ng > 1, the same approach taken 



to derive ( [T8| ) and pO| ) can be used to approximately solve for the optimum training period. 
Alternatively, bounds given in (|9]), ( flQ] ), and Lemma |2] can be used to derive simpler (and less 
accurate) results. We believe the involved expressions in this case do not provide additional 
insight into the problem at this point and are not presented here. 



D. Transmission Capacity 

Let g(Ap) = 1 — Ps^(6') = e. Accounting for overhead, the normalized transmission capacity 



IS 



T - K^N - Tt 



T 



(21) 



With the probability of successful transmission as given by ([6]), the exact expression of ^"^(e) 
for general Ns is not analytically tractable. Next we give its exact expression for the case of 
Ns = I and provide a bound for general A^^ > 1- 
1 ) Single stream from each transmitter: Using ([6]) 



1 -v^ 
1 — e T° 



S + 



N 



(22) 



where -f) = r]\rp^^f^. Substituting ^ and ([8]) into ([22]) gives 
log, 



1— e Zt7o_L-_i_na 7r_R2 



q-\e) = K 



I 

B{o,R) 



7rR2 / ( 
B{o,r) 



%^ + ||x-y-o||" 

v+^+\\^-y-o\\ 



Ns 



-1 K-1 ^ 



dx 



dy. 



/I- 



B(o,Ry 



\x—y\\' 



'\\^-y\\ 



dx 



K 



dy 



(23) 



where is the optimum training period obtained earlier. Using ( [23] ), ( |2T] ) can be computed. 
2) Greater than one stream from each transmitter: When t) > (Ng — l)/e, using Lemma [2| 



Ns-l .fc 



(24) 



fc=0 



Note that the expression for the optimum training when A^^ > 1 was not presented in Section 



in-C2 Now, if 57 > A;/e for all A; G {0, . . . , A^^ - 1}, ^7 - ^/e ~ 57 and hence ([24]) equals 

Ns-l . 



g(Ap) > 1 



k=0 



k\ 



1 - e 
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loge 



1 

1-e 



2^k=0 k\ 



/I- 



-tR2 / 



77+||2:-{/|| 



(25) 



To simplify the integrations in ( [25] ), one could also use the bound given by ^ for (f}) or 
use a similar approach taken to obtain ([9]) to find an appropriate bound for the denominator of 



IV. Spatial Multiplexing 

An alternative strategy to lA is spatial multiplexing with an orthogonal multiple access 
technique, e.g. TDMA. We analyze open-loop spatial multiplexing where at each cluster only 
a single transmit/receiver pair communicate N streams without precoding. In short, we refer to 
TDMA+SM as SM. For a fair comparison with lA, we assume a similar CSI imperfection as in 
(|2]). Therefore, the signal at a typical receiver can be written as 



yx = gxx y\J {l-Pl^)gxxii^x+Pix^xxj S^+Jc+U^, 

where 3c is defined in ([T]) with the difference that each cluster only has a single transmitter. 
In this case, the point process of the transmitters simplifies to the Poisson point process of the 
parent points. When /J^^ = 0, after a zero-forcing receiver based on H^.^, the SINR of the nth 
stream at a typical receiver can be written as 

7f,^ = 7 '-^ ^ , (26) 

where we have assumed the interference values from different transmitters are independent. The 
probability of success is given by [ |26J 

pSM(^) = P'"(7f,^ >e)= exp {-\elD',S - 9D:^^ , (27) 

7rr ( A^-|- — jrf 1 — ) 

where 3 = — — t{n) — ■ With imperfect CSI, we assume the receivers compute their zero- 
forcing receivers based on the estimated channel values and ignore the estimation error. Then, 
the denominator of (|26l) will have an additional term of gxxl^1x^xx^*xx inside the parenthesis. As 



the channel estimation error is independent of the estimated channel and the transmitter/receiver 
distance is fixed to D^, this additional term is effectively increasing the noise spectral density 
from No to + Pg^^P^x- numerator of (|26]) also changes to g^x (1 — /^L), in case of 

imperfect CSI as in (|2]), the probability of success at a typical receiver for any of the streams 



changes from ( [27] ) to 



pSM(^) = F'-i^l^ >e)= exp I -X,e-^D^d - ~eD:^ I , (28) 



where 9 = and No = No + Pgxxl^l^- 
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A. Optimum Training Period 

Similar to the lA case, each node selects a training period that optimizes its own (its own clus- 
ter's) goodput. Unlike lA, however, here the minimum training period for each transmit/receive 
pair is and no resources are spent for feedback. Hence 

rs^ = argmax NVf' {9) \og,{l + 9) 

Tt J (29) 

s.t. N <Tt< T. 



It is possible to show that the optimization problem of (|29]), after a convex relaxation on Tt, is 
also a convex problem and any of the numerical optimization algorithms [24] can be used to 
solve for the optimum Tf ^. Nevertheless, similar to the optimization problem of ([14]), we first 



replace the objective function of ([29]) with its second order Taylor expansion around fd = 
(infinite block length and hence perfect training) holding all the other variables constant, and 
then obtain an approximate closed-form solution for 

S^^ = argmax (1 - 6)NPf^{9) \og^{l + 9) 

(30) 

s.t. N <Tt<T. 

Removing the constant coefficients and simplifying the terms, the new optimization problem 



derived from the 2nd order Taylor series expansion of the objective function in (30) is 



^argmax - 6 + Bil + 

s 0^ (31) 

s.t. fdN <6< 1, 

where Bi and B2 are given in Appendix [F] Similar to the lA case, let ^3 be the relevant root of 



the first derivative of ( 131] ). The optimum training period will be given by 

ff^ = min (max {N, [63T] ) , T) . (32) 

B. Transmission Capacity 

For a given block length T, accounting for overhead, the normalized transmission capacity is 

rp _ n^SM 

c{e) = —^N\lil-e). (33) 



Using ( ]28] ), Xp is found as 



— logf^V^"^" 



K = 1^:^-^. I log I ^ : ) ^ 1 ' (34) 



where in computing 9 and No, the optimum training period, Tf^, is used 
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V. Numerical Examples and Discussion 

In this section, we present numerical examples demonstrating both the accuracy of the devel- 
oped theory for analyzing large lA networks and the effectiveness of such analysis in discovering 
operating regimes where lA outperforms other transmission schemes. For simplicity, we focus 
on the case of K = 3, N = 2, and Ns = l. To further reduce the number of involved variables, 
we set R to Dc = 0.5\p^ (average distance between the cluster centers) and set Dj. to ^. In this 
way, by increasing Dc, all the nodes move away from each other. We set No to 1 and unless 
otherwise stated, it is also assumed that all the clusters are transmitting simultaneously. In the 
forthcoming discussions, perfect channel estimation corresponds to very large values of Tt where 
(3 is assumed to be and the worst channel estimation corresponds to assigning the least number 
of channel instances required for training the cross links in an lA cluster, i.e. Tt = KN. Note 
that for perfect CSI, ([38]) and are both equal to 1. 



Probability of successful transmission with perfect CSI Assume perfect channel state 
information and 7o = 30 dB. P^"^ and Pf^ given by @ and ([27]) as a function of the SINK 



threshold for three values of the average distance between the cluster centers in shown in Fig. 
[2] As can be seen, for very dense networks, SM outperforms lA which can be explained by 
the reduced inter-cluster interference due to only a single transmit/receiver pair being active at 
each cluster. In addition, for moderately dense networks, lA outperforms SM only at high 9 
which highlights the nonlinear nature of such comparisons. For very low node density, lA has 
better performance than SM when the increased distance between transmit/receive pairs does 
not enforce a zero probability of successful transmission. 

Probability of successful transmission with imperfect CSI Now assume the worst channel 
estimation error variance for lA with the same 7o = 30 dB. The questions of interest are i) how 
much the performance of lA will be affected by introducing the imperfect CSI and ii) how does 
the relative performance between lA and SM change in this scenario. For the same values of Dc 
and 9 as in Fig. [2] the probabilities of successful transmission for lA and SM when the training 
period is lowered to KN = 6 is shown in Fig. |3] P^^ and P^^ remain relatively unchanged for 
dense networks but are reduced for moderate and high node densities. Compared to the perfect 
CSI case, SM now outperforms lA for a larger range of 9 for moderate node densities. Also, lA 
has lost most of its advantages at moderate node densities (from maximum of 0.1 to 0.06) and 
the gap between lA and SM has decreased for the widely dispersed network. As a reference, 
the error of ([6]) when Tt = 6 compared with numerical results for the curves shown in Fig. |3} 
in the worst case, is less than 0.0035. 

Optimum training period Again, assume 70 = 30 dB. The optimum training period as 



obtained using ( [18] ) and ( 132] ) for lA and SM together with the corresponding optimum values 



found through numerically optimizing ( ]T3] ) and ( 129] ) for 6* = 20 dB and two values of cluster 
radius (and hence average cluster center distance) as a function of total frame length, T, is shown 
in Fig. |4] As can be seen, ([T8]) and ( 132] ) can be used to accurately find the optimum training 



period for a wide range of node density and total frame length. As expected, lA requires more 
training than SM but surprisingly, the ratio of the optimum training periods between the two 
transmission techniques seems to be a constant value independent of the total block length, T, 
which requires further analysis not in the scope of this work. Also, as the Taylor expansions 
used to derive (]4]) and ( fT8] ) are at T = 00, small inaccuracies at small frame lengths is expected. 
Moreover, the numerical errors in numerically calculating the integrals of the coefficients in ( fTSj ) 
become harder to confine as R (and hence the integration ranges) increases which explains the 



deviation of the results obtained using (18) from the true optimum points for R = 5. 
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Transmission capacity Now fix the SINR threshold in ([23]) and ([34]) to = 17 dB and fix the 



maximum tolerable outage probability to e = 0.1. The maximum cluster density, Ap, as given 



by ( [23| ) and p4j and the corresponding transmission capacities given by ( [22] ) and ( [33] ) for two 
values of cluster radius, R, and two values of the total block length, T, as a function of transmit 
SNR, 7o, are shown in Fig. [5] and Fig. |6] Note that for i? = 5 only T = 200 curves are shown as 
T = 1000 curves were not conveying any new observations. Also, for both lA and SM, optimum 
training periods are used to compute the transmission capacity. As seen in Fig. |5} except for 
the high SNR values, increasing T allows for a better channel estimate at the receivers and 
consequently each receiver can tolerate more inter-cluster interference and hence a large cluster 
density. Also as the radius of each cluster increases, the channel estimation errors increase and 
the maximum tolerable cluster density decreases. Although by increasing T, the crossing point 
between lA and SM for Ap shifts to a lower SNR value, the corresponding crossing point for 
the transmission capacity shifts to higher SNR values which highlights the tradeoff between 
a higher obtainable throughput by increasing K in each cluster and the accompanying higher 
accumulated non-aligned interference at each receiver due to the imperfect CSI estimation. 

VI. Conclusion 

In this paper, we obtained an expression for the probability of successful transmission in 
a clustered MIMO lA network when the communication takes place over a Rayleigh fading 
quasi-static block-fading channel of a finite length and the impact of channel estimation on the 
accuracy of the obtained CSI is taken into account. We formulated an optimization of the training 
period length and explicitly solved it for the case of a single stream from each transmitter. We 
then presented the exact transmission capacity for the same case of a single stream from each 
transmitter and also provided an upper bound for the general case. Through simulations, we 
showed that probability of successful transmission and transmission capacity of TDMA-i-SM can 
be higher than lA in dense networks or when the mobility of the nodes is high. 

Our analyses in this paper were based on a generic lA filter design approach where the only 
goal of the transmitters is to align the interference without optimizing the direct links. Any 
such variations of the lA algorithm can potentially increase the gains of lA but quantifying 
such transmission techniques are out of the scope of this work. Moreover, we assumed a perfect 
analog feedback link for lA where only the minimum required channel resources are used to 
relay the exact estimated channel values at the receivers to the rest of the nodes in each cluster. 



By using the results of [10|, it is possible to include the analog feedback analysis but doing so 
would result in more involved expressions not benefiting the current scope of this work. Other 
optimization problems, such as optimizing the SINR threshold for the maximum tolerable outage 
probability, can be readily defined using the results of this paper. 

Appendix A 
Proof of Theorem [H 

Since hoo^^{Ns, 1), its complementary cumulative distribution function is F{x) — ^^'^"^'^ — 



e-Eto'i- Hence 

^'+/, + /, + /, ' I - V ""' 'V7o 



7o 
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N,-l 
k=0 



7o 



k=0 



(35) 
(36) 



s=ri 



where (a) follows from distribution of hso and (b) from the derivative property of the Laplace 
transform. Since Is, h, and li in ([5]) are all independent, ( |36l ) equals 



T7 ^ H iV 



k=0 

E 

fc=0 



A;! ds'^ 

-7])'' d'^ .^JVs 



s=rj 



k\ ds'^ 



(37) 



s=r) 



where (s) is the Laplace transform of the intra-cluster interference w.r.t the reduced Palm mea- 
sure and 'Ci^{s) is the Laplace transform of the inter-cluster interference. Since hoo ~ T{Ns, 1), 
using (|3]), 



TV 



(38) 



We now evaluate 'C'f (s) 



E!° 



1 



n I 



E'" 



n 



E'' 



Tt7o 



e _|_ Ikh. _|_ II r — ol 



(39) 



where (a) follows from the Laplace transform of h^z and in the last step ||a;||^" was substituted for 
the path loss. Observe that (|39]) is the probability generating functional (PGF) of the representative 
cluster w.r.t its reduced Palm probability. We now use the following result from [17, Lemma 
1] which we state for completeness. For any function f{x) : — t- IR+, 

n /(^) = i /" f i f /(^ - (40) 



A=B(o,R) 



A=B{o,R) 



Using (|40]), (|39]) equals 



S(o,_R) L B{o,Ry 



^+iix-7/-dr 

,s + ^ + ||x-?/-6|l" 



-1 K-l 



dx 



dy. 



(41) 
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We now focus on /Ci^(s) 



n I 



(a) 



exp - Xp 1 



1 



+ s X — 6 



■ B{o,R) 



N, 

dx 



dy 



(42) 



where (a) follows from ( [43] ) which characterizes the PGF of a Poisson cluster process |16| 

fix-y)dx] dy\. (43) 



El[f{x)=expl-\p / l- 



i?2 

B{o,R) 



Note that in (41 1, x and y vary within a cluster, while in (42), y varies over the whole plane. 



Also, in (41 1, the interference is only from the other K~l transmitters in the cluster and hence 



the exponent of K— 1, while in ( |42| ), the interference is from all the K transmitters at each 
cluster and hence the exponent of K. 



Appendix B 
Proof of Lemma [I] 



Define the following PDF 



^ xeB{o,R) 
Otherwise 



(44) 



Now, using ( |44| ) 

£i^(s) = exp I -Xp I 1 



-sh\\x—y\\ " 



]da; 



B{o,R) 



K 



dy 



exp -Xp I 1 



X 



K 



(45) 



Therefore, the inner integral in ( [45] ) can be seen as the expectation of E/j[e ^^'^^'■^ ^" "] w.r.t fp{x) 
Using Jensen's inequality, as (E^-f^])^ < E^[2^] for > 2, it follows from ( |45| ) that 



£i^(s) < exp ( -Xp 

(a) I 

= exp -A. 



1 - 



/p(x)E,[e-^'*ll^-'^ll""]^dxdi/^ 



1 



\x - y\ 



exp -Xp / fp{x 



s + \\x — y\\°' 

i\\a 



KNs 



fp{x)dxdy 



1 



\y 



exp 
exp 



-Xps^' 



s + lly'll" 

^T{KN, + 2/a)T{l-2la) 

T{KN,) 
^T{KN, + 2/a)T{l-2/a) 



dy'dx 
fp{x)dx 



r{KNs 
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where (a) follows from h ~ T(Ns, 1). Similarly for L'f 



1 



B{o,R) 



K-1 



1 



ttR^ 

B{o,R) 



(a) 

< E 



x,y 



{Eh[e 



-shf{\\x-y~6\\) 



]) 



K-1' 



(46) 



1 



11^11"+^ 



X and y are two points independently and uniformly distributed in 



where /(x) 

B{o,R), and (a) follows from the Jensen's inequality. Evaluating the expectation w.r.t h, ( |46l ) 
equals 



1 



(47) 



L(l + s/(||x-y-6||))^^(^-i) 

Let (r, 6') be the polar representation of x — y, where x and y are two points independently and 
uniformly distributed on B(o,r). Then 9 ~ f/(0,27r) and from [ ^27J the PDF of r is given by 
([11]). Note that ^ equals J ,2 (3/2, 1/2) = f (arcsin( ^^1 - x74i?2) - jl^I-x-^/ArA. 
Without loss of generality, we can assume that = {Dj.,0). Then ( |47| ) simplifies to ( flOl ). 

Appendix C 
Proof of Lemma |2] 



From ( [35] ) we have 
P 



Ns-l k 

iA__y V_^io 

k=0 

Ns~l I, 



+/,+/,+ /,V 



k=0 



7o 



+ L + L + Ii 



k=0 

Ns-l 



^Q-iv-''/^)^^^'{v-k/e)Is-^\o^-{r,-k/e)I,-^^-iv-k/e)I, 



k=0 



(48) 



where (a) follows from the observation x e < 1 for x > and the proof follows by 



comparing (48) to (37) and making the necessary change of variables. 



Appendix D 
Coefficients of ( [T7[ ) 



^^l^f ( - 2eD^'^f,KN^, (A4KA,7o + 1) 

' o 

+ 272 {-AMK - 1)N + K^, {f.K'N - 1)) 
- D,"7o (2 + 2A4irAp7o + fdN (2 + K{K + 1) 
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where 
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1 nk-i/ii' 
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= (ir - 1) 
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Appendix E 
Coefficients of ([19]) 

-^-mO + l) :^{K-l){K+2)-{K-l)A, 

N V 



N 



D2 = I D:{e + l) + ^{K-l){K + 2) + {K- 1)A, ) , 

where Ai is given in Appendix p] 



Appendix F 



D^fd 



Coefficients of (31 ) 



7o« 



B2 = (^{aNe (7o + D^) + 2-fJd^DXj - 2-fla3e^D% {2/a - 1) 



where d is defined in (27) 
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TABLE I 
Notation used in this paper 



Symbol 


Explanation 


a & A 


a is a vector and A is a matrix 


A*, A^, & A-i 


Conjugate transpose, transpose, and inverse of A 


Aj^ X m 


A is a matrix of dimension n x m 


A(n, m) 


Element on the nth row and mth column of A 


rank (A) 


Rank of A 


[a] 


Closest integer to o 


IjV & Oaxb 


N X N identity matrix and a x b matrix of all zeros 


B{o,R) 


The disk of radius R centered on the origin 




The reduced Palm probability measure 


Aij vs. A 


When Aij correspond to i and j nodes, A is a generic random variable i.i.d. to the specific ones 



TABLE II 

Important symbols defined in this paper 



Pa 


Channel access probabiUty of each cluster 


*P 


Spatial locations of the cluster centers with density Ap 


K 


Number of transmit/receive pairs in each cluster 


N 


Number of antennas at each node 


Ns 


Number of streams from each transmitter 




The spatial locations of the potential transmitters with density K\p 




The cluster to which the transmitter at the origin belongs by 


Ap 


Density of the active transmitters (equals PaAp) 




Distance of a receiver from its corresponding transmitter 




Average distance between the cluster centers 


R 


Radius of each cluster 




Variance of the channel estimation error and pathloss for the link between transmitter z and receiver x 


T, U, Tu&Tt 


Length of the constant channel block, Doppler frequency (fa ~ training period, and feedback period 


e 


SINR threshold for successful transmission 
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Transmitter Location 
Typical Receiver 
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Fig. 1. An instance of tlie transmitter's distribution when tlie number of transmitters per cluster is K = 3. 
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SINR threshold (dB) 



Fig. 2. The probability of successful transmission for lA and TDMA+SM as a function of the SINR threshold for some values 
of the average distance between the cluster centers Dc with perfect CSI. 
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SINR threshold (dB) 

Fig. 3. P'* and Pf^ as a function of the SINR threshold for some values of the average distance between the cluster centers 
Dc with the smallest training period of Tt — 6. 




Frame length 



Fig. 4. The optimum training period as obtained using jl8| l and \i2\ for lA and SM together with the corresponding optimum 
values found through numerically optimizing ^13) and l |29[ ) at two values of cluster radius (and hence average cluster center 
distance) for 6 = 20 dB and 70 — 30 dB as a function of total frame length, T. 
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SNR(dB) 



Fig. 5. The maximum cluster density, Ap, as given by l|23j and for two values of cluster radius, R, and two values of 
the total block length, T, at the SINR threshold of 6 = 17 dB and the maximum tolerable outage probability of e = 0.1 as a 
function of transmit SNR, 7o. 
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Fig. 6. The transmission capacity, c(e), as given by \22\ and l[33j for two values of cluster radius, R, and two values of the 
total block length, T, at the SINR threshold of S = 17 dB and the maximum tolerable outage probability of e = 0.1 as a 
function of transmit SNR, 70. In each case, the optimum training periods as given by ([TsJ and \32\ were used to compute the 
transmission capacities. 



